Dataset statistics
| Number of variables | 39 |
|---|---|
| Number of observations | 74250 |
| Missing cells | 58358 |
| Missing cells (%) | 2.0% |
| Duplicate rows | 56 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 22.7 MiB |
| Average record size in memory | 320.0 B |
Variable types
| Numeric | 9 |
|---|---|
| DateTime | 1 |
| Text | 7 |
| Categorical | 20 |
| Boolean | 2 |
recorded_by has constant value "GeoData Consultants Ltd" | Constant |
| Dataset has 56 (0.1%) duplicate rows | Duplicates |
public_meeting is highly imbalanced (56.2%) | Imbalance |
management_group is highly imbalanced (69.1%) | Imbalance |
water_quality is highly imbalanced (71.3%) | Imbalance |
quality_group is highly imbalanced (67.9%) | Imbalance |
funder has 4507 (6.1%) missing values | Missing |
installer has 4532 (6.1%) missing values | Missing |
public_meeting has 4155 (5.6%) missing values | Missing |
scheme_management has 4847 (6.5%) missing values | Missing |
scheme_name has 36052 (48.6%) missing values | Missing |
permit has 3793 (5.1%) missing values | Missing |
amount_tsh is highly skewed (γ1 = 56.37002144) | Skewed |
num_private is highly skewed (γ1 = 91.3269825) | Skewed |
amount_tsh has 52049 (70.1%) zeros | Zeros |
gps_height has 25649 (34.5%) zeros | Zeros |
longitude has 2269 (3.1%) zeros | Zeros |
num_private has 73299 (98.7%) zeros | Zeros |
population has 26834 (36.1%) zeros | Zeros |
construction_year has 25969 (35.0%) zeros | Zeros |
Reproduction
| Analysis started | 2024-08-31 23:07:54.698297 |
|---|---|
| Analysis finished | 2024-08-31 23:08:16.243264 |
| Duration | 21.54 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
amount_tsh
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 318.6857 |
| Minimum | 0 |
|---|---|
| Maximum | 350000 |
| Zeros | 52049 |
| Zeros (%) | 70.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 20 |
| 95-th percentile | 1200 |
| Maximum | 350000 |
| Range | 350000 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 2906.7624 |
|---|---|
| Coefficient of variation (CV) | 9.1210943 |
| Kurtosis | 4766.5651 |
| Mean | 318.6857 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 56.370021 |
| Sum | 23662414 |
| Variance | 8449267.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 52049 | |
| 500 | 3874 | 5.2% |
| 50 | 3103 | 4.2% |
| 1000 | 1858 | 2.5% |
| 20 | 1812 | 2.4% |
| 200 | 1516 | 2.0% |
| 100 | 1034 | 1.4% |
| 10 | 995 | 1.3% |
| 30 | 929 | 1.3% |
| 2000 | 882 | 1.2% |
| Other values (92) | 6198 | 8.3% |
| Value | Count | Frequency (%) |
| 0 | 52049 | |
| 0.2 | 4 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 1 | 3 | < 0.1% |
| 2 | 18 | < 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 471 | 0.6% |
| 6 | 231 | 0.3% |
| 7 | 87 | 0.1% |
| Value | Count | Frequency (%) |
| 350000 | 1 | < 0.1% |
| 250000 | 1 | < 0.1% |
| 200000 | 2 | < 0.1% |
| 170000 | 1 | < 0.1% |
| 138000 | 1 | < 0.1% |
| 120000 | 1 | < 0.1% |
| 117000 | 7 | |
| 100000 | 4 | |
| 70000 | 2 | < 0.1% |
| 60000 | 2 | < 0.1% |
date_recorded
Date
| Distinct | 369 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Minimum | 2001-03-26 00:00:00 |
|---|---|
| Maximum | 2013-12-03 00:00:00 |
funder
Text
MISSING 
| Distinct | 2139 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 4507 |
| Missing (%) | 6.1% |
| Memory size | 1.1 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 27 |
| Mean length | 9.916264 |
| Min length | 1 |
Characters and Unicode
| Total characters | 691590 |
|---|---|
| Distinct characters | 70 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1129 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | Roman |
|---|---|
| 2nd row | Grumeti |
| 3rd row | Lottery Club |
| 4th row | Unicef |
| 5th row | Action In A |
| Value | Count | Frequency (%) |
| of | 12116 | 10.7% |
| government | 11536 | 10.2% |
| tanzania | 11406 | 10.1% |
| danida | 3921 | 3.5% |
| world | 3501 | 3.1% |
| water | 3303 | 2.9% |
| hesawa | 2783 | 2.5% |
| bank | 1790 | 1.6% |
| kkkt | 1732 | 1.5% |
| rwssp | 1705 | 1.5% |
| Other values (2305) | 59055 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 85335 | 12.3% |
| n | 72148 | 10.4% |
| i | 47485 | 6.9% |
| e | 46776 | 6.8% |
| 43186 | 6.2% | |
| r | 34873 | 5.0% |
| t | 28714 | 4.2% |
| o | 28372 | 4.1% |
| s | 21436 | 3.1% |
| d | 19386 | 2.8% |
| Other values (60) | 263879 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 691590 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 85335 | 12.3% |
| n | 72148 | 10.4% |
| i | 47485 | 6.9% |
| e | 46776 | 6.8% |
| 43186 | 6.2% | |
| r | 34873 | 5.0% |
| t | 28714 | 4.2% |
| o | 28372 | 4.1% |
| s | 21436 | 3.1% |
| d | 19386 | 2.8% |
| Other values (60) | 263879 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 691590 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 85335 | 12.3% |
| n | 72148 | 10.4% |
| i | 47485 | 6.9% |
| e | 46776 | 6.8% |
| 43186 | 6.2% | |
| r | 34873 | 5.0% |
| t | 28714 | 4.2% |
| o | 28372 | 4.1% |
| s | 21436 | 3.1% |
| d | 19386 | 2.8% |
| Other values (60) | 263879 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 691590 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 85335 | 12.3% |
| n | 72148 | 10.4% |
| i | 47485 | 6.9% |
| e | 46776 | 6.8% |
| 43186 | 6.2% | |
| r | 34873 | 5.0% |
| t | 28714 | 4.2% |
| o | 28372 | 4.1% |
| s | 21436 | 3.1% |
| d | 19386 | 2.8% |
| Other values (60) | 263879 |
gps_height
Real number (ℝ)
ZEROS 
| Distinct | 2456 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 665.66731 |
| Minimum | -90 |
|---|---|
| Maximum | 2777 |
| Zeros | 25649 |
| Zeros (%) | 34.5% |
| Negative | 1881 |
| Negative (%) | 2.5% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | -90 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 364 |
| Q3 | 1317 |
| 95-th percentile | 1796 |
| Maximum | 2777 |
| Range | 2867 |
| Interquartile range (IQR) | 1317 |
Descriptive statistics
| Standard deviation | 692.76103 |
|---|---|
| Coefficient of variation (CV) | 1.0407016 |
| Kurtosis | -1.2860423 |
| Mean | 665.66731 |
| Median Absolute Deviation (MAD) | 364 |
| Skewness | 0.46929439 |
| Sum | 49425798 |
| Variance | 479917.85 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 25649 | |
| -16 | 71 | 0.1% |
| -15 | 69 | 0.1% |
| -13 | 68 | 0.1% |
| -19 | 65 | 0.1% |
| -14 | 64 | 0.1% |
| 1290 | 60 | 0.1% |
| -18 | 60 | 0.1% |
| 303 | 59 | 0.1% |
| -20 | 58 | 0.1% |
| Other values (2446) | 48027 |
| Value | Count | Frequency (%) |
| -90 | 1 | < 0.1% |
| -63 | 2 | |
| -59 | 1 | < 0.1% |
| -57 | 2 | |
| -56 | 1 | < 0.1% |
| -55 | 1 | < 0.1% |
| -54 | 1 | < 0.1% |
| -53 | 1 | < 0.1% |
| -52 | 2 | |
| -51 | 3 |
| Value | Count | Frequency (%) |
| 2777 | 1 | |
| 2770 | 1 | |
| 2628 | 1 | |
| 2627 | 1 | |
| 2626 | 2 | |
| 2623 | 1 | |
| 2614 | 1 | |
| 2585 | 1 | |
| 2576 | 2 | |
| 2569 | 1 |
installer
Text
MISSING 
| Distinct | 2410 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 4532 |
| Missing (%) | 6.1% |
| Memory size | 1.1 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 29 |
| Mean length | 6.0973063 |
| Min length | 1 |
Characters and Unicode
| Total characters | 425092 |
|---|---|
| Distinct characters | 71 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1245 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | Roman |
|---|---|
| 2nd row | GRUMETI |
| 3rd row | World vision |
| 4th row | UNICEF |
| 5th row | Artisan |
| Value | Count | Frequency (%) |
| dwe | 22004 | |
| government | 3450 | 4.0% |
| water | 2301 | 2.7% |
| hesawa | 1768 | 2.1% |
| rwe | 1526 | 1.8% |
| district | 1491 | 1.7% |
| kkkt | 1445 | 1.7% |
| council | 1356 | 1.6% |
| commu | 1354 | 1.6% |
| danida | 1307 | 1.5% |
| Other values (2191) | 47268 |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 34447 | 8.1% |
| W | 32323 | 7.6% |
| E | 31711 | 7.5% |
| a | 21693 | 5.1% |
| n | 20670 | 4.9% |
| e | 19282 | 4.5% |
| i | 18760 | 4.4% |
| A | 17012 | 4.0% |
| r | 16604 | 3.9% |
| t | 15918 | 3.7% |
| Other values (61) | 196672 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 425092 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| D | 34447 | 8.1% |
| W | 32323 | 7.6% |
| E | 31711 | 7.5% |
| a | 21693 | 5.1% |
| n | 20670 | 4.9% |
| e | 19282 | 4.5% |
| i | 18760 | 4.4% |
| A | 17012 | 4.0% |
| r | 16604 | 3.9% |
| t | 15918 | 3.7% |
| Other values (61) | 196672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 425092 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| D | 34447 | 8.1% |
| W | 32323 | 7.6% |
| E | 31711 | 7.5% |
| a | 21693 | 5.1% |
| n | 20670 | 4.9% |
| e | 19282 | 4.5% |
| i | 18760 | 4.4% |
| A | 17012 | 4.0% |
| r | 16604 | 3.9% |
| t | 15918 | 3.7% |
| Other values (61) | 196672 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 425092 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| D | 34447 | 8.1% |
| W | 32323 | 7.6% |
| E | 31711 | 7.5% |
| a | 21693 | 5.1% |
| n | 20670 | 4.9% |
| e | 19282 | 4.5% |
| i | 18760 | 4.4% |
| A | 17012 | 4.0% |
| r | 16604 | 3.9% |
| t | 15918 | 3.7% |
| Other values (61) | 196672 |
longitude
Real number (ℝ)
ZEROS 
| Distinct | 71870 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.074262 |
| Minimum | 0 |
|---|---|
| Maximum | 40.345193 |
| Zeros | 2269 |
| Zeros (%) | 3.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 30.043194 |
| Q1 | 33.086819 |
| median | 34.907475 |
| Q3 | 37.181685 |
| 95-th percentile | 39.13025 |
| Maximum | 40.345193 |
| Range | 40.345193 |
| Interquartile range (IQR) | 4.094866 |
Descriptive statistics
| Standard deviation | 6.5725188 |
|---|---|
| Coefficient of variation (CV) | 0.19288807 |
| Kurtosis | 19.148748 |
| Mean | 34.074262 |
| Median Absolute Deviation (MAD) | 2.0389258 |
| Skewness | -4.187363 |
| Sum | 2530014 |
| Variance | 43.198004 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2269 | 3.1% |
| 37.5320952 | 2 | < 0.1% |
| 32.99387218 | 2 | < 0.1% |
| 38.34050134 | 2 | < 0.1% |
| 37.54080503 | 2 | < 0.1% |
| 32.9936827 | 2 | < 0.1% |
| 32.9780624 | 2 | < 0.1% |
| 39.10375198 | 2 | < 0.1% |
| 39.09206155 | 2 | < 0.1% |
| 39.08843697 | 2 | < 0.1% |
| Other values (71860) | 71963 |
| Value | Count | Frequency (%) |
| 0 | 2269 | |
| 29.6071219 | 1 | < 0.1% |
| 29.60720109 | 1 | < 0.1% |
| 29.61032056 | 1 | < 0.1% |
| 29.61096482 | 1 | < 0.1% |
| 29.61194674 | 1 | < 0.1% |
| 29.61250689 | 1 | < 0.1% |
| 29.61276296 | 1 | < 0.1% |
| 29.61277618 | 1 | < 0.1% |
| 29.61344309 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 40.34519307 | 1 | |
| 40.34430089 | 1 | |
| 40.32523996 | 1 | |
| 40.32522643 | 1 | |
| 40.32501564 | 1 | |
| 40.32340181 | 1 | |
| 40.32283237 | 1 | |
| 40.32280453 | 1 | |
| 40.3226251 | 1 | |
| 40.32216902 | 1 |
latitude
Real number (ℝ)
| Distinct | 71869 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -5.701771 |
| Minimum | -11.64944 |
|---|---|
| Maximum | -2 × 10-8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 74250 |
| Negative (%) | 100.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | -11.64944 |
|---|---|
| 5-th percentile | -10.586484 |
| Q1 | -8.525675 |
| median | -5.0265399 |
| Q3 | -3.3250579 |
| 95-th percentile | -1.4081268 |
| Maximum | -2 × 10-8 |
| Range | 11.64944 |
| Interquartile range (IQR) | 5.2006171 |
Descriptive statistics
| Standard deviation | 2.9449691 |
|---|---|
| Coefficient of variation (CV) | -0.51650077 |
| Kurtosis | -1.0542077 |
| Mean | -5.701771 |
| Median Absolute Deviation (MAD) | 2.0688622 |
| Skewness | -0.15288081 |
| Sum | -423356.5 |
| Variance | 8.6728431 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -2 × 10-8 | 2269 | 3.1% |
| -2.49645868 | 2 | < 0.1% |
| -7.05637235 | 2 | < 0.1% |
| -6.98602609 | 2 | < 0.1% |
| -6.95674564 | 2 | < 0.1% |
| -2.49454559 | 2 | < 0.1% |
| -2.490689 | 2 | < 0.1% |
| -2.51661892 | 2 | < 0.1% |
| -2.48004347 | 2 | < 0.1% |
| -7.17908174 | 2 | < 0.1% |
| Other values (71859) | 71963 |
| Value | Count | Frequency (%) |
| -11.64944018 | 1 | |
| -11.64837759 | 1 | |
| -11.58629656 | 1 | |
| -11.56857679 | 1 | |
| -11.56680457 | 1 | |
| -11.56459195 | 1 | |
| -11.56450865 | 1 | |
| -11.56432357 | 1 | |
| -11.56231592 | 1 | |
| -11.56228898 | 1 |
| Value | Count | Frequency (%) |
| -2 × 10-8 | 2269 | |
| -0.99846435 | 1 | < 0.1% |
| -0.99875229 | 1 | < 0.1% |
| -0.998916 | 1 | < 0.1% |
| -0.99901209 | 1 | < 0.1% |
| -0.99911702 | 1 | < 0.1% |
| -0.9994692 | 1 | < 0.1% |
| -0.99950651 | 1 | < 0.1% |
| -0.99952232 | 1 | < 0.1% |
| -1.00058519 | 1 | < 0.1% |
wpt_name
Text
| Distinct | 45683 |
|---|---|
| Distinct (%) | 61.5% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 1.1 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 25 |
| Mean length | 10.977171 |
| Min length | 1 |
Characters and Unicode
| Total characters | 815033 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 39882 ? |
|---|---|
| Unique (%) | 53.7% |
Sample
| 1st row | none |
|---|---|
| 2nd row | Zahanati |
| 3rd row | Kwa Mahundi |
| 4th row | Zahanati Ya Nanyumbu |
| 5th row | Shuleni |
| Value | Count | Frequency (%) |
| kwa | 26774 | 19.6% |
| none | 4440 | 3.2% |
| mzee | 4264 | 3.1% |
| shuleni | 2696 | 2.0% |
| ya | 1865 | 1.4% |
| shule | 1755 | 1.3% |
| school | 1403 | 1.0% |
| primary | 1335 | 1.0% |
| zahanati | 1231 | 0.9% |
| msingi | 1102 | 0.8% |
| Other values (34870) | 89851 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 123688 | |
| i | 65417 | 8.0% |
| 62473 | 7.7% | |
| n | 52634 | 6.5% |
| e | 51422 | 6.3% |
| w | 39672 | 4.9% |
| K | 39197 | 4.8% |
| o | 37761 | 4.6% |
| u | 30433 | 3.7% |
| M | 27612 | 3.4% |
| Other values (66) | 284724 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 815033 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 123688 | |
| i | 65417 | 8.0% |
| 62473 | 7.7% | |
| n | 52634 | 6.5% |
| e | 51422 | 6.3% |
| w | 39672 | 4.9% |
| K | 39197 | 4.8% |
| o | 37761 | 4.6% |
| u | 30433 | 3.7% |
| M | 27612 | 3.4% |
| Other values (66) | 284724 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 815033 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 123688 | |
| i | 65417 | 8.0% |
| 62473 | 7.7% | |
| n | 52634 | 6.5% |
| e | 51422 | 6.3% |
| w | 39672 | 4.9% |
| K | 39197 | 4.8% |
| o | 37761 | 4.6% |
| u | 30433 | 3.7% |
| M | 27612 | 3.4% |
| Other values (66) | 284724 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 815033 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 123688 | |
| i | 65417 | 8.0% |
| 62473 | 7.7% | |
| n | 52634 | 6.5% |
| e | 51422 | 6.3% |
| w | 39672 | 4.9% |
| K | 39197 | 4.8% |
| o | 37761 | 4.6% |
| u | 30433 | 3.7% |
| M | 27612 | 3.4% |
| Other values (66) | 284724 |
num_private
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 68 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.46232997 |
| Minimum | 0 |
|---|---|
| Maximum | 1776 |
| Zeros | 73299 |
| Zeros (%) | 98.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1776 |
| Range | 1776 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 11.537879 |
|---|---|
| Coefficient of variation (CV) | 24.955939 |
| Kurtosis | 11449.87 |
| Mean | 0.46232997 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 91.326983 |
| Sum | 34328 |
| Variance | 133.12264 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 73299 | |
| 1 | 94 | 0.1% |
| 6 | 92 | 0.1% |
| 5 | 60 | 0.1% |
| 8 | 58 | 0.1% |
| 15 | 47 | 0.1% |
| 32 | 45 | 0.1% |
| 45 | 41 | 0.1% |
| 3 | 38 | 0.1% |
| 93 | 37 | < 0.1% |
| Other values (58) | 439 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 73299 | |
| 1 | 94 | 0.1% |
| 2 | 31 | < 0.1% |
| 3 | 38 | 0.1% |
| 4 | 30 | < 0.1% |
| 5 | 60 | 0.1% |
| 6 | 92 | 0.1% |
| 7 | 31 | < 0.1% |
| 8 | 58 | 0.1% |
| 9 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1776 | 1 | |
| 1402 | 1 | |
| 755 | 1 | |
| 698 | 1 | |
| 672 | 1 | |
| 669 | 1 | |
| 668 | 1 | |
| 450 | 1 | |
| 420 | 1 | |
| 300 | 1 |
basin
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Lake Victoria | |
|---|---|
| Pangani | |
| Rufiji | |
| Internal | |
| Lake Tanganyika | |
| Other values (4) |
Length
| Max length | 23 |
|---|---|
| Median length | 11 |
| Mean length | 10.894545 |
| Min length | 6 |
Characters and Unicode
| Total characters | 808920 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Lake Nyasa |
|---|---|
| 2nd row | Lake Victoria |
| 3rd row | Pangani |
| 4th row | Ruvuma / Southern Coast |
| 5th row | Lake Victoria |
Common Values
| Value | Count | Frequency (%) |
| Lake Victoria | 12871 | |
| Pangani | 11143 | |
| Rufiji | 9987 | |
| Internal | 9642 | |
| Lake Tanganyika | 8052 | |
| Wami / Ruvu | 7577 | |
| Lake Nyasa | 6332 | |
| Ruvuma / Southern Coast | 5587 | |
| Lake Rukwa | 3059 | 4.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| lake | 30314 | |
| 13164 | ||
| victoria | 12871 | |
| pangani | 11143 | 8.2% |
| rufiji | 9987 | 7.3% |
| internal | 9642 | 7.1% |
| tanganyika | 8052 | 5.9% |
| wami | 7577 | 5.6% |
| ruvu | 7577 | 5.6% |
| nyasa | 6332 | 4.6% |
| Other values (4) | 19820 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 133743 | |
| i | 72488 | 9.0% |
| n | 63261 | 7.8% |
| 62229 | 7.7% | |
| e | 45543 | 5.6% |
| u | 44961 | 5.6% |
| k | 41425 | 5.1% |
| t | 33687 | 4.2% |
| L | 30314 | 3.7% |
| r | 28100 | 3.5% |
| Other values (22) | 253169 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 808920 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 133743 | |
| i | 72488 | 9.0% |
| n | 63261 | 7.8% |
| 62229 | 7.7% | |
| e | 45543 | 5.6% |
| u | 44961 | 5.6% |
| k | 41425 | 5.1% |
| t | 33687 | 4.2% |
| L | 30314 | 3.7% |
| r | 28100 | 3.5% |
| Other values (22) | 253169 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 808920 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 133743 | |
| i | 72488 | 9.0% |
| n | 63261 | 7.8% |
| 62229 | 7.7% | |
| e | 45543 | 5.6% |
| u | 44961 | 5.6% |
| k | 41425 | 5.1% |
| t | 33687 | 4.2% |
| L | 30314 | 3.7% |
| r | 28100 | 3.5% |
| Other values (22) | 253169 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 808920 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 133743 | |
| i | 72488 | 9.0% |
| n | 63261 | 7.8% |
| 62229 | 7.7% | |
| e | 45543 | 5.6% |
| u | 44961 | 5.6% |
| k | 41425 | 5.1% |
| t | 33687 | 4.2% |
| L | 30314 | 3.7% |
| r | 28100 | 3.5% |
| Other values (22) | 253169 |
subvillage
Text
| Distinct | 21425 |
|---|---|
| Distinct (%) | 29.0% |
| Missing | 470 |
| Missing (%) | 0.6% |
| Memory size | 1.1 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 27 |
| Mean length | 7.898997 |
| Min length | 1 |
Characters and Unicode
| Total characters | 582788 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9752 ? |
|---|---|
| Unique (%) | 13.2% |
Sample
| 1st row | Mnyusi B |
|---|---|
| 2nd row | Nyamara |
| 3rd row | Majengo |
| 4th row | Mahakamani |
| 5th row | Kyanyamisa |
| Value | Count | Frequency (%) |
| a | 3016 | 3.4% |
| b | 2524 | 2.9% |
| kati | 2351 | 2.7% |
| majengo | 768 | 0.9% |
| wa | 762 | 0.9% |
| shuleni | 754 | 0.9% |
| madukani | 709 | 0.8% |
| mtaa | 656 | 0.7% |
| juu | 504 | 0.6% |
| mjini | 458 | 0.5% |
| Other values (18756) | 76014 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 90081 | |
| i | 56910 | 9.8% |
| n | 41876 | 7.2% |
| u | 32997 | 5.7% |
| e | 32135 | 5.5% |
| o | 29502 | 5.1% |
| M | 25477 | 4.4% |
| g | 23754 | 4.1% |
| l | 20522 | 3.5% |
| m | 18839 | 3.2% |
| Other values (63) | 210695 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 582788 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 90081 | |
| i | 56910 | 9.8% |
| n | 41876 | 7.2% |
| u | 32997 | 5.7% |
| e | 32135 | 5.5% |
| o | 29502 | 5.1% |
| M | 25477 | 4.4% |
| g | 23754 | 4.1% |
| l | 20522 | 3.5% |
| m | 18839 | 3.2% |
| Other values (63) | 210695 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 582788 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 90081 | |
| i | 56910 | 9.8% |
| n | 41876 | 7.2% |
| u | 32997 | 5.7% |
| e | 32135 | 5.5% |
| o | 29502 | 5.1% |
| M | 25477 | 4.4% |
| g | 23754 | 4.1% |
| l | 20522 | 3.5% |
| m | 18839 | 3.2% |
| Other values (63) | 210695 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 582788 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 90081 | |
| i | 56910 | 9.8% |
| n | 41876 | 7.2% |
| u | 32997 | 5.7% |
| e | 32135 | 5.5% |
| o | 29502 | 5.1% |
| M | 25477 | 4.4% |
| g | 23754 | 4.1% |
| l | 20522 | 3.5% |
| m | 18839 | 3.2% |
| Other values (63) | 210695 |
region
Categorical
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Iringa | |
|---|---|
| Shinyanga | |
| Mbeya | |
| Kilimanjaro | |
| Morogoro | |
| Other values (16) |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 6.6294141 |
| Min length | 4 |
Characters and Unicode
| Total characters | 492234 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Iringa |
|---|---|
| 2nd row | Mara |
| 3rd row | Manyara |
| 4th row | Mtwara |
| 5th row | Kagera |
Common Values
| Value | Count | Frequency (%) |
| Iringa | 6599 | 8.9% |
| Shinyanga | 6293 | 8.5% |
| Mbeya | 5758 | 7.8% |
| Kilimanjaro | 5494 | 7.4% |
| Morogoro | 5038 | 6.8% |
| Kagera | 4174 | 5.6% |
| Arusha | 4111 | 5.5% |
| Mwanza | 3897 | 5.2% |
| Kigoma | 3533 | 4.8% |
| Pwani | 3331 | 4.5% |
| Other values (11) | 26022 |
Length
| Value | Count | Frequency (%) |
| iringa | 6599 | 8.6% |
| shinyanga | 6293 | 8.2% |
| mbeya | 5758 | 7.5% |
| kilimanjaro | 5494 | 7.2% |
| morogoro | 5038 | 6.6% |
| kagera | 4174 | 5.5% |
| arusha | 4111 | 5.4% |
| mwanza | 3897 | 5.1% |
| kigoma | 3533 | 4.6% |
| pwani | 3331 | 4.4% |
| Other values (13) | 28062 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 104401 | |
| n | 41521 | 8.4% |
| r | 40507 | 8.2% |
| i | 39656 | 8.1% |
| o | 37203 | 7.6% |
| g | 31359 | 6.4% |
| M | 21260 | 4.3% |
| m | 16132 | 3.3% |
| y | 14023 | 2.8% |
| K | 13201 | 2.7% |
| Other values (22) | 132971 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 492234 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 104401 | |
| n | 41521 | 8.4% |
| r | 40507 | 8.2% |
| i | 39656 | 8.1% |
| o | 37203 | 7.6% |
| g | 31359 | 6.4% |
| M | 21260 | 4.3% |
| m | 16132 | 3.3% |
| y | 14023 | 2.8% |
| K | 13201 | 2.7% |
| Other values (22) | 132971 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 492234 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 104401 | |
| n | 41521 | 8.4% |
| r | 40507 | 8.2% |
| i | 39656 | 8.1% |
| o | 37203 | 7.6% |
| g | 31359 | 6.4% |
| M | 21260 | 4.3% |
| m | 16132 | 3.3% |
| y | 14023 | 2.8% |
| K | 13201 | 2.7% |
| Other values (22) | 132971 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 492234 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 104401 | |
| n | 41521 | 8.4% |
| r | 40507 | 8.2% |
| i | 39656 | 8.1% |
| o | 37203 | 7.6% |
| g | 31359 | 6.4% |
| M | 21260 | 4.3% |
| m | 16132 | 3.3% |
| y | 14023 | 2.8% |
| K | 13201 | 2.7% |
| Other values (22) | 132971 |
region_code
Real number (ℝ)
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.265414 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 12 |
| Q3 | 17 |
| 95-th percentile | 60 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 17.508907 |
|---|---|
| Coefficient of variation (CV) | 1.1469657 |
| Kurtosis | 10.354697 |
| Mean | 15.265414 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 3.1794543 |
| Sum | 1133457 |
| Variance | 306.56182 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 6608 | 8.9% |
| 17 | 6334 | 8.5% |
| 12 | 5759 | 7.8% |
| 3 | 5494 | 7.4% |
| 5 | 5079 | 6.8% |
| 18 | 4183 | 5.6% |
| 19 | 3824 | 5.2% |
| 2 | 3709 | 5.0% |
| 16 | 3533 | 4.8% |
| 10 | 3306 | 4.5% |
| Other values (17) | 26421 |
| Value | Count | Frequency (%) |
| 1 | 2779 | |
| 2 | 3709 | |
| 3 | 5494 | |
| 4 | 3145 | |
| 5 | 5079 | |
| 6 | 2032 | 2.7% |
| 7 | 1020 | 1.4% |
| 8 | 375 | 0.5% |
| 9 | 499 | 0.7% |
| 10 | 3306 |
| Value | Count | Frequency (%) |
| 99 | 512 | 0.7% |
| 90 | 1133 | 1.5% |
| 80 | 1536 | 2.1% |
| 60 | 1298 | 1.7% |
| 40 | 1 | < 0.1% |
| 24 | 402 | 0.5% |
| 21 | 1972 | |
| 20 | 2451 | |
| 19 | 3824 | |
| 18 | 4183 |
district_code
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6290774 |
| Minimum | 0 |
|---|---|
| Maximum | 80 |
| Zeros | 27 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 30 |
| Maximum | 80 |
| Range | 80 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 9.6416356 |
|---|---|
| Coefficient of variation (CV) | 1.712827 |
| Kurtosis | 16.191722 |
| Mean | 5.6290774 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.9614329 |
| Sum | 417959 |
| Variance | 92.961136 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 15299 | |
| 2 | 13929 | |
| 3 | 12521 | |
| 4 | 11253 | |
| 5 | 5428 | 7.3% |
| 6 | 5108 | 6.9% |
| 7 | 4166 | 5.6% |
| 8 | 1282 | 1.7% |
| 30 | 1256 | 1.7% |
| 33 | 1063 | 1.4% |
| Other values (10) | 2945 | 4.0% |
| Value | Count | Frequency (%) |
| 0 | 27 | < 0.1% |
| 1 | 15299 | |
| 2 | 13929 | |
| 3 | 12521 | |
| 4 | 11253 | |
| 5 | 5428 | 7.3% |
| 6 | 5108 | 6.9% |
| 7 | 4166 | 5.6% |
| 8 | 1282 | 1.7% |
| 13 | 496 | 0.7% |
| Value | Count | Frequency (%) |
| 80 | 13 | < 0.1% |
| 67 | 8 | < 0.1% |
| 63 | 264 | 0.4% |
| 62 | 127 | 0.2% |
| 60 | 76 | 0.1% |
| 53 | 921 | |
| 43 | 653 | |
| 33 | 1063 | |
| 30 | 1256 | |
| 23 | 360 | 0.5% |
lga
Text
| Distinct | 125 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 7.4073805 |
| Min length | 3 |
Characters and Unicode
| Total characters | 549998 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ludewa |
|---|---|
| 2nd row | Serengeti |
| 3rd row | Simanjiro |
| 4th row | Nanyumbu |
| 5th row | Karagwe |
| Value | Count | Frequency (%) |
| rural | 11814 | 13.4% |
| njombe | 3128 | 3.5% |
| urban | 2118 | 2.4% |
| moshi | 1669 | 1.9% |
| arusha | 1603 | 1.8% |
| bariadi | 1485 | 1.7% |
| singida | 1410 | 1.6% |
| rungwe | 1381 | 1.6% |
| kilosa | 1368 | 1.6% |
| kasulu | 1322 | 1.5% |
| Other values (106) | 60884 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 87352 | |
| o | 37693 | 6.9% |
| i | 36767 | 6.7% |
| u | 35252 | 6.4% |
| r | 33487 | 6.1% |
| e | 28292 | 5.1% |
| n | 28081 | 5.1% |
| l | 23976 | 4.4% |
| g | 22965 | 4.2% |
| M | 19956 | 3.6% |
| Other values (31) | 196177 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 549998 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 87352 | |
| o | 37693 | 6.9% |
| i | 36767 | 6.7% |
| u | 35252 | 6.4% |
| r | 33487 | 6.1% |
| e | 28292 | 5.1% |
| n | 28081 | 5.1% |
| l | 23976 | 4.4% |
| g | 22965 | 4.2% |
| M | 19956 | 3.6% |
| Other values (31) | 196177 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 549998 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 87352 | |
| o | 37693 | 6.9% |
| i | 36767 | 6.7% |
| u | 35252 | 6.4% |
| r | 33487 | 6.1% |
| e | 28292 | 5.1% |
| n | 28081 | 5.1% |
| l | 23976 | 4.4% |
| g | 22965 | 4.2% |
| M | 19956 | 3.6% |
| Other values (31) | 196177 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 549998 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 87352 | |
| o | 37693 | 6.9% |
| i | 36767 | 6.7% |
| u | 35252 | 6.4% |
| r | 33487 | 6.1% |
| e | 28292 | 5.1% |
| n | 28081 | 5.1% |
| l | 23976 | 4.4% |
| g | 22965 | 4.2% |
| M | 19956 | 3.6% |
| Other values (31) | 196177 |
ward
Text
| Distinct | 2098 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 19 |
| Mean length | 7.5064242 |
| Min length | 3 |
Characters and Unicode
| Total characters | 557352 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Mundindi |
|---|---|
| 2nd row | Natta |
| 3rd row | Ngorika |
| 4th row | Nanyumbu |
| 5th row | Nyakasimbi |
| Value | Count | Frequency (%) |
| mashariki | 720 | 0.9% |
| urban | 666 | 0.8% |
| siha | 550 | 0.7% |
| kusini | 488 | 0.6% |
| magharibi | 472 | 0.6% |
| igosi | 386 | 0.5% |
| masama | 382 | 0.5% |
| machame | 363 | 0.4% |
| kati | 342 | 0.4% |
| imalinyi | 318 | 0.4% |
| Other values (2112) | 76252 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 86986 | |
| i | 50510 | 9.1% |
| n | 36882 | 6.6% |
| u | 33914 | 6.1% |
| o | 32443 | 5.8% |
| e | 29383 | 5.3% |
| g | 26356 | 4.7% |
| M | 23580 | 4.2% |
| m | 20301 | 3.6% |
| l | 19770 | 3.5% |
| Other values (44) | 197227 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 557352 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 86986 | |
| i | 50510 | 9.1% |
| n | 36882 | 6.6% |
| u | 33914 | 6.1% |
| o | 32443 | 5.8% |
| e | 29383 | 5.3% |
| g | 26356 | 4.7% |
| M | 23580 | 4.2% |
| m | 20301 | 3.6% |
| l | 19770 | 3.5% |
| Other values (44) | 197227 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 557352 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 86986 | |
| i | 50510 | 9.1% |
| n | 36882 | 6.6% |
| u | 33914 | 6.1% |
| o | 32443 | 5.8% |
| e | 29383 | 5.3% |
| g | 26356 | 4.7% |
| M | 23580 | 4.2% |
| m | 20301 | 3.6% |
| l | 19770 | 3.5% |
| Other values (44) | 197227 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 557352 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 86986 | |
| i | 50510 | 9.1% |
| n | 36882 | 6.6% |
| u | 33914 | 6.1% |
| o | 32443 | 5.8% |
| e | 29383 | 5.3% |
| g | 26356 | 4.7% |
| M | 23580 | 4.2% |
| m | 20301 | 3.6% |
| l | 19770 | 3.5% |
| Other values (44) | 197227 |
population
Real number (ℝ)
ZEROS 
| Distinct | 1128 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 180.75083 |
| Minimum | 0 |
|---|---|
| Maximum | 30500 |
| Zeros | 26834 |
| Zeros (%) | 36.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 25 |
| Q3 | 215 |
| 95-th percentile | 690 |
| Maximum | 30500 |
| Range | 30500 |
| Interquartile range (IQR) | 215 |
Descriptive statistics
| Standard deviation | 471.08612 |
|---|---|
| Coefficient of variation (CV) | 2.6062736 |
| Kurtosis | 343.36556 |
| Mean | 180.75083 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 11.780615 |
| Sum | 13420749 |
| Variance | 221922.13 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 26834 | |
| 1 | 8782 | 11.8% |
| 200 | 2370 | 3.2% |
| 150 | 2328 | 3.1% |
| 250 | 2087 | 2.8% |
| 300 | 1842 | 2.5% |
| 50 | 1437 | 1.9% |
| 100 | 1419 | 1.9% |
| 500 | 1274 | 1.7% |
| 350 | 1252 | 1.7% |
| Other values (1118) | 24625 |
| Value | Count | Frequency (%) |
| 0 | 26834 | |
| 1 | 8782 | 11.8% |
| 2 | 9 | < 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 15 | < 0.1% |
| 5 | 50 | 0.1% |
| 6 | 27 | < 0.1% |
| 7 | 3 | < 0.1% |
| 8 | 29 | < 0.1% |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 30500 | 1 | < 0.1% |
| 15300 | 1 | < 0.1% |
| 11469 | 1 | < 0.1% |
| 11463 | 1 | < 0.1% |
| 10000 | 3 | |
| 9865 | 1 | < 0.1% |
| 9800 | 1 | < 0.1% |
| 9500 | 1 | < 0.1% |
| 9000 | 4 | |
| 8848 | 1 | < 0.1% |
public_meeting
Boolean
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4155 |
| Missing (%) | 5.6% |
| Memory size | 1.1 MiB |
| True | |
|---|---|
| False | 6346 |
| (Missing) | 4155 |
| Value | Count | Frequency (%) |
| True | 63749 | |
| False | 6346 | 8.5% |
| (Missing) | 4155 | 5.6% |
recorded_by
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| GeoData Consultants Ltd |
|---|
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Characters and Unicode
| Total characters | 1707750 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GeoData Consultants Ltd |
|---|---|
| 2nd row | GeoData Consultants Ltd |
| 3rd row | GeoData Consultants Ltd |
| 4th row | GeoData Consultants Ltd |
| 5th row | GeoData Consultants Ltd |
Common Values
| Value | Count | Frequency (%) |
| GeoData Consultants Ltd | 74250 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| geodata | 74250 | |
| consultants | 74250 | |
| ltd | 74250 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 297000 | |
| a | 222750 | |
| o | 148500 | |
| 148500 | ||
| n | 148500 | |
| s | 148500 | |
| G | 74250 | 4.3% |
| e | 74250 | 4.3% |
| D | 74250 | 4.3% |
| C | 74250 | 4.3% |
| Other values (4) | 297000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1707750 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 297000 | |
| a | 222750 | |
| o | 148500 | |
| 148500 | ||
| n | 148500 | |
| s | 148500 | |
| G | 74250 | 4.3% |
| e | 74250 | 4.3% |
| D | 74250 | 4.3% |
| C | 74250 | 4.3% |
| Other values (4) | 297000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1707750 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 297000 | |
| a | 222750 | |
| o | 148500 | |
| 148500 | ||
| n | 148500 | |
| s | 148500 | |
| G | 74250 | 4.3% |
| e | 74250 | 4.3% |
| D | 74250 | 4.3% |
| C | 74250 | 4.3% |
| Other values (4) | 297000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1707750 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 297000 | |
| a | 222750 | |
| o | 148500 | |
| 148500 | ||
| n | 148500 | |
| s | 148500 | |
| G | 74250 | 4.3% |
| e | 74250 | 4.3% |
| D | 74250 | 4.3% |
| C | 74250 | 4.3% |
| Other values (4) | 297000 |
scheme_management
Categorical
MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4847 |
| Missing (%) | 6.5% |
| Memory size | 1.1 MiB |
| VWC | |
|---|---|
| WUG | |
| Water authority | 3975 |
| WUA | 3551 |
| Water Board | 3462 |
| Other values (6) |
Length
| Max length | 16 |
|---|---|
| Median length | 3 |
| Mean length | 4.6575941 |
| Min length | 3 |
Characters and Unicode
| Total characters | 323251 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | VWC |
|---|---|
| 2nd row | Other |
| 3rd row | VWC |
| 4th row | VWC |
| 5th row | VWC |
Common Values
| Value | Count | Frequency (%) |
| VWC | 45917 | |
| WUG | 6496 | 8.7% |
| Water authority | 3975 | 5.4% |
| WUA | 3551 | 4.8% |
| Water Board | 3462 | 4.7% |
| Parastatal | 2124 | 2.9% |
| Company | 1341 | 1.8% |
| Private operator | 1326 | 1.8% |
| Other | 996 | 1.3% |
| SWC | 123 | 0.2% |
| (Missing) | 4847 | 6.5% |
Length
| Value | Count | Frequency (%) |
| vwc | 45917 | |
| water | 7437 | 9.5% |
| wug | 6496 | 8.3% |
| authority | 3975 | 5.1% |
| wua | 3551 | 4.5% |
| board | 3462 | 4.4% |
| parastatal | 2124 | 2.7% |
| company | 1341 | 1.7% |
| private | 1326 | 1.7% |
| operator | 1326 | 1.7% |
| Other values (3) | 1211 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 63524 | |
| C | 47381 | |
| V | 45917 | |
| a | 27363 | |
| t | 23375 | 7.2% |
| r | 22064 | 6.8% |
| o | 11430 | 3.5% |
| e | 11085 | 3.4% |
| U | 10047 | 3.1% |
| 8763 | 2.7% | |
| Other values (18) | 52302 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 323251 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| W | 63524 | |
| C | 47381 | |
| V | 45917 | |
| a | 27363 | |
| t | 23375 | 7.2% |
| r | 22064 | 6.8% |
| o | 11430 | 3.5% |
| e | 11085 | 3.4% |
| U | 10047 | 3.1% |
| 8763 | 2.7% | |
| Other values (18) | 52302 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 323251 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| W | 63524 | |
| C | 47381 | |
| V | 45917 | |
| a | 27363 | |
| t | 23375 | 7.2% |
| r | 22064 | 6.8% |
| o | 11430 | 3.5% |
| e | 11085 | 3.4% |
| U | 10047 | 3.1% |
| 8763 | 2.7% | |
| Other values (18) | 52302 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 323251 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| W | 63524 | |
| C | 47381 | |
| V | 45917 | |
| a | 27363 | |
| t | 23375 | 7.2% |
| r | 22064 | 6.8% |
| o | 11430 | 3.5% |
| e | 11085 | 3.4% |
| U | 10047 | 3.1% |
| 8763 | 2.7% | |
| Other values (18) | 52302 |
scheme_name
Text
MISSING 
| Distinct | 2867 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 36052 |
| Missing (%) | 48.6% |
| Memory size | 1.1 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 37 |
| Mean length | 14.487539 |
| Min length | 1 |
Characters and Unicode
| Total characters | 553395 |
|---|---|
| Distinct characters | 68 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 752 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | Roman |
|---|---|
| 2nd row | Nyumba ya mungu pipe scheme |
| 3rd row | Zingibali |
| 4th row | BL Bondeni |
| 5th row | wanging'ombe water supply s |
| Value | Count | Frequency (%) |
| water | 12153 | 13.7% |
| supply | 8382 | 9.4% |
| scheme | 3152 | 3.5% |
| wa | 2693 | 3.0% |
| gravity | 2356 | 2.7% |
| maji | 1668 | 1.9% |
| pipe | 1640 | 1.8% |
| mradi | 1371 | 1.5% |
| line | 1225 | 1.4% |
| supplied | 1091 | 1.2% |
| Other values (2623) | 53165 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 60494 | 10.9% |
| 51290 | 9.3% | |
| e | 43069 | 7.8% |
| i | 32776 | 5.9% |
| p | 27880 | 5.0% |
| r | 27119 | 4.9% |
| t | 23804 | 4.3% |
| u | 23019 | 4.2% |
| l | 21577 | 3.9% |
| n | 21341 | 3.9% |
| Other values (58) | 221026 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 553395 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 60494 | 10.9% |
| 51290 | 9.3% | |
| e | 43069 | 7.8% |
| i | 32776 | 5.9% |
| p | 27880 | 5.0% |
| r | 27119 | 4.9% |
| t | 23804 | 4.3% |
| u | 23019 | 4.2% |
| l | 21577 | 3.9% |
| n | 21341 | 3.9% |
| Other values (58) | 221026 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 553395 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 60494 | 10.9% |
| 51290 | 9.3% | |
| e | 43069 | 7.8% |
| i | 32776 | 5.9% |
| p | 27880 | 5.0% |
| r | 27119 | 4.9% |
| t | 23804 | 4.3% |
| u | 23019 | 4.2% |
| l | 21577 | 3.9% |
| n | 21341 | 3.9% |
| Other values (58) | 221026 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 553395 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 60494 | 10.9% |
| 51290 | 9.3% | |
| e | 43069 | 7.8% |
| i | 32776 | 5.9% |
| p | 27880 | 5.0% |
| r | 27119 | 4.9% |
| t | 23804 | 4.3% |
| u | 23019 | 4.2% |
| l | 21577 | 3.9% |
| n | 21341 | 3.9% |
| Other values (58) | 221026 |
permit
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3793 |
| Missing (%) | 5.1% |
| Memory size | 1.1 MiB |
| True | |
|---|---|
| False | |
| (Missing) | 3793 |
| Value | Count | Frequency (%) |
| True | 48606 | |
| False | 21851 | |
| (Missing) | 3793 | 5.1% |
construction_year
Real number (ℝ)
ZEROS 
| Distinct | 55 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1298.4636 |
| Minimum | 0 |
|---|---|
| Maximum | 2013 |
| Zeros | 25969 |
| Zeros (%) | 35.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1986 |
| Q3 | 2004 |
| 95-th percentile | 2010 |
| Maximum | 2013 |
| Range | 2013 |
| Interquartile range (IQR) | 2004 |
Descriptive statistics
| Standard deviation | 952.34938 |
|---|---|
| Coefficient of variation (CV) | 0.73344323 |
| Kurtosis | -1.6029319 |
| Mean | 1298.4636 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | -0.62978407 |
| Sum | 96410926 |
| Variance | 906969.33 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 25969 | |
| 2010 | 3314 | 4.5% |
| 2008 | 3243 | 4.4% |
| 2009 | 3196 | 4.3% |
| 2000 | 2578 | 3.5% |
| 2007 | 1960 | 2.6% |
| 2006 | 1892 | 2.5% |
| 2011 | 1591 | 2.1% |
| 2003 | 1579 | 2.1% |
| 2004 | 1417 | 1.9% |
| Other values (45) | 27511 |
| Value | Count | Frequency (%) |
| 0 | 25969 | |
| 1960 | 124 | 0.2% |
| 1961 | 28 | < 0.1% |
| 1962 | 36 | < 0.1% |
| 1963 | 107 | 0.1% |
| 1964 | 48 | 0.1% |
| 1965 | 21 | < 0.1% |
| 1966 | 19 | < 0.1% |
| 1967 | 106 | 0.1% |
| 1968 | 93 | 0.1% |
| Value | Count | Frequency (%) |
| 2013 | 209 | 0.3% |
| 2012 | 1347 | |
| 2011 | 1591 | |
| 2010 | 3314 | |
| 2009 | 3196 | |
| 2008 | 3243 | |
| 2007 | 1960 | |
| 2006 | 1892 | |
| 2005 | 1275 | 1.7% |
| 2004 | 1417 |
extraction_type
Categorical
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| gravity | |
|---|---|
| nira/tanira | |
| other | |
| submersible | |
| swn 80 | |
| Other values (13) |
Length
| Max length | 25 |
|---|---|
| Median length | 17 |
| Mean length | 7.7207003 |
| Min length | 3 |
Characters and Unicode
| Total characters | 573262 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | gravity |
|---|---|
| 2nd row | gravity |
| 3rd row | gravity |
| 4th row | submersible |
| 5th row | gravity |
Common Values
| Value | Count | Frequency (%) |
| gravity | 33263 | |
| nira/tanira | 10205 | 13.7% |
| other | 8102 | 10.9% |
| submersible | 5982 | 8.1% |
| swn 80 | 4588 | 6.2% |
| mono | 3628 | 4.9% |
| india mark ii | 3029 | 4.1% |
| afridev | 2208 | 3.0% |
| ksb | 1790 | 2.4% |
| other - rope pump | 572 | 0.8% |
| Other values (8) | 883 | 1.2% |
Length
| Value | Count | Frequency (%) |
| gravity | 33263 | |
| nira/tanira | 10205 | 11.6% |
| other | 9061 | 10.3% |
| submersible | 5982 | 6.8% |
| swn | 4872 | 5.5% |
| 80 | 4588 | 5.2% |
| mono | 3628 | 4.1% |
| india | 3164 | 3.6% |
| mark | 3164 | 3.6% |
| ii | 3029 | 3.4% |
| Other values (13) | 7085 | 8.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 75123 | |
| r | 74660 | |
| a | 72622 | |
| t | 52529 | |
| v | 35471 | 6.2% |
| y | 33366 | 5.8% |
| g | 33265 | 5.8% |
| n | 32230 | 5.6% |
| e | 23913 | 4.2% |
| s | 18628 | 3.2% |
| Other values (19) | 121455 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 573262 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 75123 | |
| r | 74660 | |
| a | 72622 | |
| t | 52529 | |
| v | 35471 | 6.2% |
| y | 33366 | 5.8% |
| g | 33265 | 5.8% |
| n | 32230 | 5.6% |
| e | 23913 | 4.2% |
| s | 18628 | 3.2% |
| Other values (19) | 121455 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 573262 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 75123 | |
| r | 74660 | |
| a | 72622 | |
| t | 52529 | |
| v | 35471 | 6.2% |
| y | 33366 | 5.8% |
| g | 33265 | 5.8% |
| n | 32230 | 5.6% |
| e | 23913 | 4.2% |
| s | 18628 | 3.2% |
| Other values (19) | 121455 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 573262 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 75123 | |
| r | 74660 | |
| a | 72622 | |
| t | 52529 | |
| v | 35471 | 6.2% |
| y | 33366 | 5.8% |
| g | 33265 | 5.8% |
| n | 32230 | 5.6% |
| e | 23913 | 4.2% |
| s | 18628 | 3.2% |
| Other values (19) | 121455 |
extraction_type_group
Categorical
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| gravity | |
|---|---|
| nira/tanira | |
| other | |
| submersible | |
| swn 80 | |
| Other values (8) |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 7.8831785 |
| Min length | 4 |
Characters and Unicode
| Total characters | 585326 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | gravity |
|---|---|
| 2nd row | gravity |
| 3rd row | gravity |
| 4th row | submersible |
| 5th row | gravity |
Common Values
| Value | Count | Frequency (%) |
| gravity | 33263 | |
| nira/tanira | 10205 | 13.7% |
| other | 8102 | 10.9% |
| submersible | 7772 | 10.5% |
| swn 80 | 4588 | 6.2% |
| mono | 3628 | 4.9% |
| india mark ii | 3029 | 4.1% |
| afridev | 2208 | 3.0% |
| rope pump | 572 | 0.8% |
| other handpump | 447 | 0.6% |
| Other values (3) | 436 | 0.6% |
Length
| Value | Count | Frequency (%) |
| gravity | 33263 | |
| nira/tanira | 10205 | 11.8% |
| other | 8698 | 10.1% |
| submersible | 7772 | 9.0% |
| swn | 4588 | 5.3% |
| 80 | 4588 | 5.3% |
| mono | 3628 | 4.2% |
| mark | 3164 | 3.7% |
| india | 3164 | 3.7% |
| ii | 3029 | 3.5% |
| Other values (7) | 4235 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 76596 | |
| r | 76388 | |
| a | 72861 | |
| t | 52315 | |
| v | 35471 | 6.1% |
| g | 33263 | 5.7% |
| y | 33263 | 5.7% |
| n | 32389 | 5.5% |
| e | 27326 | 4.7% |
| s | 20132 | 3.4% |
| Other values (16) | 125322 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 585326 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 76596 | |
| r | 76388 | |
| a | 72861 | |
| t | 52315 | |
| v | 35471 | 6.1% |
| g | 33263 | 5.7% |
| y | 33263 | 5.7% |
| n | 32389 | 5.5% |
| e | 27326 | 4.7% |
| s | 20132 | 3.4% |
| Other values (16) | 125322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 585326 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 76596 | |
| r | 76388 | |
| a | 72861 | |
| t | 52315 | |
| v | 35471 | 6.1% |
| g | 33263 | 5.7% |
| y | 33263 | 5.7% |
| n | 32389 | 5.5% |
| e | 27326 | 4.7% |
| s | 20132 | 3.4% |
| Other values (16) | 125322 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 585326 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 76596 | |
| r | 76388 | |
| a | 72861 | |
| t | 52315 | |
| v | 35471 | 6.1% |
| g | 33263 | 5.7% |
| y | 33263 | 5.7% |
| n | 32389 | 5.5% |
| e | 27326 | 4.7% |
| s | 20132 | 3.4% |
| Other values (16) | 125322 |
extraction_type_class
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| gravity | |
|---|---|
| handpump | |
| other | |
| submersible | |
| motorpump | |
| Other values (2) | 724 |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 7.6054411 |
| Min length | 5 |
Characters and Unicode
| Total characters | 564704 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | gravity |
|---|---|
| 2nd row | gravity |
| 3rd row | gravity |
| 4th row | submersible |
| 5th row | gravity |
Common Values
| Value | Count | Frequency (%) |
| gravity | 33263 | |
| handpump | 20612 | |
| other | 8102 | 10.9% |
| submersible | 7772 | 10.5% |
| motorpump | 3777 | 5.1% |
| rope pump | 572 | 0.8% |
| wind-powered | 152 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| gravity | 33263 | |
| handpump | 20612 | |
| other | 8102 | 10.8% |
| submersible | 7772 | 10.4% |
| motorpump | 3777 | 5.0% |
| rope | 572 | 0.8% |
| pump | 572 | 0.8% |
| wind-powered | 152 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 53875 | 9.5% |
| r | 53638 | 9.5% |
| p | 50646 | 9.0% |
| t | 45142 | 8.0% |
| i | 41187 | 7.3% |
| m | 36510 | 6.5% |
| g | 33263 | 5.9% |
| y | 33263 | 5.9% |
| v | 33263 | 5.9% |
| u | 32733 | 5.8% |
| Other values (11) | 151184 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 564704 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 53875 | 9.5% |
| r | 53638 | 9.5% |
| p | 50646 | 9.0% |
| t | 45142 | 8.0% |
| i | 41187 | 7.3% |
| m | 36510 | 6.5% |
| g | 33263 | 5.9% |
| y | 33263 | 5.9% |
| v | 33263 | 5.9% |
| u | 32733 | 5.8% |
| Other values (11) | 151184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 564704 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 53875 | 9.5% |
| r | 53638 | 9.5% |
| p | 50646 | 9.0% |
| t | 45142 | 8.0% |
| i | 41187 | 7.3% |
| m | 36510 | 6.5% |
| g | 33263 | 5.9% |
| y | 33263 | 5.9% |
| v | 33263 | 5.9% |
| u | 32733 | 5.8% |
| Other values (11) | 151184 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 564704 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 53875 | 9.5% |
| r | 53638 | 9.5% |
| p | 50646 | 9.0% |
| t | 45142 | 8.0% |
| i | 41187 | 7.3% |
| m | 36510 | 6.5% |
| g | 33263 | 5.9% |
| y | 33263 | 5.9% |
| v | 33263 | 5.9% |
| u | 32733 | 5.8% |
| Other values (11) | 151184 |
management
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| vwc | |
|---|---|
| wug | |
| water board | 3688 |
| wua | 3118 |
| private operator | 2504 |
| Other values (7) |
Length
| Max length | 16 |
|---|---|
| Median length | 3 |
| Mean length | 4.3611448 |
| Min length | 3 |
Characters and Unicode
| Total characters | 323815 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | vwc |
|---|---|
| 2nd row | wug |
| 3rd row | vwc |
| 4th row | vwc |
| 5th row | other |
Common Values
| Value | Count | Frequency (%) |
| vwc | 50624 | |
| wug | 8108 | 10.9% |
| water board | 3688 | 5.0% |
| wua | 3118 | 4.2% |
| private operator | 2504 | 3.4% |
| parastatal | 2229 | 3.0% |
| water authority | 1123 | 1.5% |
| other | 1083 | 1.5% |
| company | 859 | 1.2% |
| unknown | 683 | 0.9% |
| Other values (2) | 231 | 0.3% |
Length
| Value | Count | Frequency (%) |
| vwc | 50624 | |
| wug | 8108 | 9.9% |
| water | 4811 | 5.9% |
| board | 3688 | 4.5% |
| wua | 3118 | 3.8% |
| private | 2504 | 3.1% |
| operator | 2504 | 3.1% |
| parastatal | 2229 | 2.7% |
| other | 1209 | 1.5% |
| authority | 1123 | 1.4% |
| Other values (5) | 1899 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 67344 | |
| v | 53128 | |
| c | 51609 | |
| a | 27523 | |
| r | 20677 | 6.4% |
| t | 17942 | 5.5% |
| u | 13137 | 4.1% |
| o | 12822 | 4.0% |
| e | 11028 | 3.4% |
| g | 8108 | 2.5% |
| Other values (13) | 40497 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 323815 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| w | 67344 | |
| v | 53128 | |
| c | 51609 | |
| a | 27523 | |
| r | 20677 | 6.4% |
| t | 17942 | 5.5% |
| u | 13137 | 4.1% |
| o | 12822 | 4.0% |
| e | 11028 | 3.4% |
| g | 8108 | 2.5% |
| Other values (13) | 40497 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 323815 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| w | 67344 | |
| v | 53128 | |
| c | 51609 | |
| a | 27523 | |
| r | 20677 | 6.4% |
| t | 17942 | 5.5% |
| u | 13137 | 4.1% |
| o | 12822 | 4.0% |
| e | 11028 | 3.4% |
| g | 8108 | 2.5% |
| Other values (13) | 40497 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 323815 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| w | 67344 | |
| v | 53128 | |
| c | 51609 | |
| a | 27523 | |
| r | 20677 | 6.4% |
| t | 17942 | 5.5% |
| u | 13137 | 4.1% |
| o | 12822 | 4.0% |
| e | 11028 | 3.4% |
| g | 8108 | 2.5% |
| Other values (13) | 40497 |
management_group
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| user-group | |
|---|---|
| commercial | 4591 |
| parastatal | 2229 |
| other | 1209 |
| unknown | 683 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.8909899 |
| Min length | 5 |
Characters and Unicode
| Total characters | 734406 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | user-group |
|---|---|
| 2nd row | user-group |
| 3rd row | user-group |
| 4th row | user-group |
| 5th row | other |
Common Values
| Value | Count | Frequency (%) |
| user-group | 65538 | |
| commercial | 4591 | 6.2% |
| parastatal | 2229 | 3.0% |
| other | 1209 | 1.6% |
| unknown | 683 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| user-group | 65538 | |
| commercial | 4591 | 6.2% |
| parastatal | 2229 | 3.0% |
| other | 1209 | 1.6% |
| unknown | 683 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 139105 | |
| u | 131759 | |
| o | 72021 | |
| e | 71338 | |
| s | 67767 | |
| p | 67767 | |
| - | 65538 | |
| g | 65538 | |
| a | 13507 | 1.8% |
| m | 9182 | 1.3% |
| Other values (8) | 30884 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 734406 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 139105 | |
| u | 131759 | |
| o | 72021 | |
| e | 71338 | |
| s | 67767 | |
| p | 67767 | |
| - | 65538 | |
| g | 65538 | |
| a | 13507 | 1.8% |
| m | 9182 | 1.3% |
| Other values (8) | 30884 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 734406 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 139105 | |
| u | 131759 | |
| o | 72021 | |
| e | 71338 | |
| s | 67767 | |
| p | 67767 | |
| - | 65538 | |
| g | 65538 | |
| a | 13507 | 1.8% |
| m | 9182 | 1.3% |
| Other values (8) | 30884 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 734406 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 139105 | |
| u | 131759 | |
| o | 72021 | |
| e | 71338 | |
| s | 67767 | |
| p | 67767 | |
| - | 65538 | |
| g | 65538 | |
| a | 13507 | 1.8% |
| m | 9182 | 1.3% |
| Other values (8) | 30884 | 4.2% |
payment
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| never pay | |
|---|---|
| pay per bucket | |
| pay monthly | |
| unknown | |
| pay when scheme fails | |
| Other values (2) |
Length
| Max length | 21 |
|---|---|
| Median length | 14 |
| Mean length | 10.661737 |
| Min length | 5 |
Characters and Unicode
| Total characters | 791634 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | pay annually |
|---|---|
| 2nd row | never pay |
| 3rd row | pay per bucket |
| 4th row | never pay |
| 5th row | never pay |
Common Values
| Value | Count | Frequency (%) |
| never pay | 31712 | |
| pay per bucket | 11266 | 15.2% |
| pay monthly | 10397 | 14.0% |
| unknown | 10149 | 13.7% |
| pay when scheme fails | 4842 | 6.5% |
| pay annually | 4570 | 6.2% |
| other | 1314 | 1.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pay | 62787 | |
| never | 31712 | |
| per | 11266 | 7.1% |
| bucket | 11266 | 7.1% |
| monthly | 10397 | 6.6% |
| unknown | 10149 | 6.4% |
| when | 4842 | 3.1% |
| scheme | 4842 | 3.1% |
| fails | 4842 | 3.1% |
| annually | 4570 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 101796 | |
| n | 86538 | |
| 83737 | ||
| y | 77754 | |
| a | 76769 | |
| p | 74053 | |
| r | 44292 | 5.6% |
| v | 31712 | 4.0% |
| u | 25985 | 3.3% |
| l | 24379 | 3.1% |
| Other values (11) | 164619 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 791634 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 101796 | |
| n | 86538 | |
| 83737 | ||
| y | 77754 | |
| a | 76769 | |
| p | 74053 | |
| r | 44292 | 5.6% |
| v | 31712 | 4.0% |
| u | 25985 | 3.3% |
| l | 24379 | 3.1% |
| Other values (11) | 164619 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 791634 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 101796 | |
| n | 86538 | |
| 83737 | ||
| y | 77754 | |
| a | 76769 | |
| p | 74053 | |
| r | 44292 | 5.6% |
| v | 31712 | 4.0% |
| u | 25985 | 3.3% |
| l | 24379 | 3.1% |
| Other values (11) | 164619 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 791634 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 101796 | |
| n | 86538 | |
| 83737 | ||
| y | 77754 | |
| a | 76769 | |
| p | 74053 | |
| r | 44292 | 5.6% |
| v | 31712 | 4.0% |
| u | 25985 | 3.3% |
| l | 24379 | 3.1% |
| Other values (11) | 164619 |
payment_type
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| never pay | |
|---|---|
| per bucket | |
| monthly | |
| unknown | |
| on failure | |
| Other values (2) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.5311785 |
| Min length | 5 |
Characters and Unicode
| Total characters | 633440 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | annually |
|---|---|
| 2nd row | never pay |
| 3rd row | per bucket |
| 4th row | never pay |
| 5th row | never pay |
Common Values
| Value | Count | Frequency (%) |
| never pay | 31712 | |
| per bucket | 11266 | 15.2% |
| monthly | 10397 | 14.0% |
| unknown | 10149 | 13.7% |
| on failure | 4842 | 6.5% |
| annually | 4570 | 6.2% |
| other | 1314 | 1.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| never | 31712 | |
| pay | 31712 | |
| per | 11266 | 9.2% |
| bucket | 11266 | 9.2% |
| monthly | 10397 | 8.5% |
| unknown | 10149 | 8.3% |
| on | 4842 | 4.0% |
| failure | 4842 | 4.0% |
| annually | 4570 | 3.7% |
| other | 1314 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 92112 | |
| n | 86538 | |
| r | 49134 | 7.8% |
| 47820 | 7.5% | |
| y | 46679 | 7.4% |
| a | 45694 | 7.2% |
| p | 42978 | 6.8% |
| v | 31712 | 5.0% |
| u | 30827 | 4.9% |
| o | 26702 | 4.2% |
| Other values (10) | 133244 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 633440 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 92112 | |
| n | 86538 | |
| r | 49134 | 7.8% |
| 47820 | 7.5% | |
| y | 46679 | 7.4% |
| a | 45694 | 7.2% |
| p | 42978 | 6.8% |
| v | 31712 | 5.0% |
| u | 30827 | 4.9% |
| o | 26702 | 4.2% |
| Other values (10) | 133244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 633440 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 92112 | |
| n | 86538 | |
| r | 49134 | 7.8% |
| 47820 | 7.5% | |
| y | 46679 | 7.4% |
| a | 45694 | 7.2% |
| p | 42978 | 6.8% |
| v | 31712 | 5.0% |
| u | 30827 | 4.9% |
| o | 26702 | 4.2% |
| Other values (10) | 133244 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 633440 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 92112 | |
| n | 86538 | |
| r | 49134 | 7.8% |
| 47820 | 7.5% | |
| y | 46679 | 7.4% |
| a | 45694 | 7.2% |
| p | 42978 | 6.8% |
| v | 31712 | 5.0% |
| u | 30827 | 4.9% |
| o | 26702 | 4.2% |
| Other values (10) | 133244 |
water_quality
Categorical
IMBALANCE 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| soft | |
|---|---|
| salty | 6082 |
| unknown | 2345 |
| milky | 1005 |
| coloured | 623 |
| Other values (3) | 690 |
Length
| Max length | 18 |
|---|---|
| Median length | 4 |
| Mean length | 4.3039057 |
| Min length | 4 |
Characters and Unicode
| Total characters | 319565 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | soft |
|---|---|
| 2nd row | soft |
| 3rd row | soft |
| 4th row | soft |
| 5th row | soft |
Common Values
| Value | Count | Frequency (%) |
| soft | 63505 | |
| salty | 6082 | 8.2% |
| unknown | 2345 | 3.2% |
| milky | 1005 | 1.4% |
| coloured | 623 | 0.8% |
| salty abandoned | 423 | 0.6% |
| fluoride | 244 | 0.3% |
| fluoride abandoned | 23 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| soft | 63505 | |
| salty | 6505 | 8.7% |
| unknown | 2345 | 3.1% |
| milky | 1005 | 1.3% |
| coloured | 623 | 0.8% |
| abandoned | 446 | 0.6% |
| fluoride | 267 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 70010 | |
| t | 70010 | |
| o | 67809 | |
| f | 63772 | |
| l | 8400 | 2.6% |
| n | 7927 | 2.5% |
| y | 7510 | 2.4% |
| a | 7397 | 2.3% |
| k | 3350 | 1.0% |
| u | 3235 | 1.0% |
| Other values (9) | 10145 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 319565 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 70010 | |
| t | 70010 | |
| o | 67809 | |
| f | 63772 | |
| l | 8400 | 2.6% |
| n | 7927 | 2.5% |
| y | 7510 | 2.4% |
| a | 7397 | 2.3% |
| k | 3350 | 1.0% |
| u | 3235 | 1.0% |
| Other values (9) | 10145 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 319565 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 70010 | |
| t | 70010 | |
| o | 67809 | |
| f | 63772 | |
| l | 8400 | 2.6% |
| n | 7927 | 2.5% |
| y | 7510 | 2.4% |
| a | 7397 | 2.3% |
| k | 3350 | 1.0% |
| u | 3235 | 1.0% |
| Other values (9) | 10145 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 319565 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 70010 | |
| t | 70010 | |
| o | 67809 | |
| f | 63772 | |
| l | 8400 | 2.6% |
| n | 7927 | 2.5% |
| y | 7510 | 2.4% |
| a | 7397 | 2.3% |
| k | 3350 | 1.0% |
| u | 3235 | 1.0% |
| Other values (9) | 10145 | 3.2% |
quality_group
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| good | |
|---|---|
| salty | |
| unknown | 2345 |
| milky | 1005 |
| colored | 623 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.2354478 |
| Min length | 4 |
Characters and Unicode
| Total characters | 314482 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | good |
|---|---|
| 2nd row | good |
| 3rd row | good |
| 4th row | good |
| 5th row | good |
Common Values
| Value | Count | Frequency (%) |
| good | 63505 | |
| salty | 6505 | 8.8% |
| unknown | 2345 | 3.2% |
| milky | 1005 | 1.4% |
| colored | 623 | 0.8% |
| fluoride | 267 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| good | 63505 | |
| salty | 6505 | 8.8% |
| unknown | 2345 | 3.2% |
| milky | 1005 | 1.4% |
| colored | 623 | 0.8% |
| fluoride | 267 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 130868 | |
| d | 64395 | |
| g | 63505 | |
| l | 8400 | 2.7% |
| y | 7510 | 2.4% |
| n | 7035 | 2.2% |
| t | 6505 | 2.1% |
| a | 6505 | 2.1% |
| s | 6505 | 2.1% |
| k | 3350 | 1.1% |
| Other values (8) | 9904 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 314482 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 130868 | |
| d | 64395 | |
| g | 63505 | |
| l | 8400 | 2.7% |
| y | 7510 | 2.4% |
| n | 7035 | 2.2% |
| t | 6505 | 2.1% |
| a | 6505 | 2.1% |
| s | 6505 | 2.1% |
| k | 3350 | 1.1% |
| Other values (8) | 9904 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 314482 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 130868 | |
| d | 64395 | |
| g | 63505 | |
| l | 8400 | 2.7% |
| y | 7510 | 2.4% |
| n | 7035 | 2.2% |
| t | 6505 | 2.1% |
| a | 6505 | 2.1% |
| s | 6505 | 2.1% |
| k | 3350 | 1.1% |
| Other values (8) | 9904 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 314482 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 130868 | |
| d | 64395 | |
| g | 63505 | |
| l | 8400 | 2.7% |
| y | 7510 | 2.4% |
| n | 7035 | 2.2% |
| t | 6505 | 2.1% |
| a | 6505 | 2.1% |
| s | 6505 | 2.1% |
| k | 3350 | 1.1% |
| Other values (8) | 9904 | 3.1% |
quantity
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| enough | |
|---|---|
| insufficient | |
| dry | |
| seasonal | |
| unknown | 975 |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 7.3623569 |
| Min length | 3 |
Characters and Unicode
| Total characters | 546655 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | enough |
|---|---|
| 2nd row | insufficient |
| 3rd row | enough |
| 4th row | dry |
| 5th row | seasonal |
Common Values
| Value | Count | Frequency (%) |
| enough | 41522 | |
| insufficient | 18896 | |
| dry | 7782 | 10.5% |
| seasonal | 5075 | 6.8% |
| unknown | 975 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| enough | 41522 | |
| insufficient | 18896 | |
| dry | 7782 | 10.5% |
| seasonal | 5075 | 6.8% |
| unknown | 975 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 87314 | |
| e | 65493 | |
| u | 61393 | |
| i | 56688 | |
| o | 47572 | |
| g | 41522 | |
| h | 41522 | |
| f | 37792 | |
| s | 29046 | 5.3% |
| t | 18896 | 3.5% |
| Other values (8) | 59417 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 546655 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 87314 | |
| e | 65493 | |
| u | 61393 | |
| i | 56688 | |
| o | 47572 | |
| g | 41522 | |
| h | 41522 | |
| f | 37792 | |
| s | 29046 | 5.3% |
| t | 18896 | 3.5% |
| Other values (8) | 59417 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 546655 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 87314 | |
| e | 65493 | |
| u | 61393 | |
| i | 56688 | |
| o | 47572 | |
| g | 41522 | |
| h | 41522 | |
| f | 37792 | |
| s | 29046 | 5.3% |
| t | 18896 | 3.5% |
| Other values (8) | 59417 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 546655 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 87314 | |
| e | 65493 | |
| u | 61393 | |
| i | 56688 | |
| o | 47572 | |
| g | 41522 | |
| h | 41522 | |
| f | 37792 | |
| s | 29046 | 5.3% |
| t | 18896 | 3.5% |
| Other values (8) | 59417 |
quantity_group
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| enough | |
|---|---|
| insufficient | |
| dry | |
| seasonal | |
| unknown | 975 |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 7.3623569 |
| Min length | 3 |
Characters and Unicode
| Total characters | 546655 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | enough |
|---|---|
| 2nd row | insufficient |
| 3rd row | enough |
| 4th row | dry |
| 5th row | seasonal |
Common Values
| Value | Count | Frequency (%) |
| enough | 41522 | |
| insufficient | 18896 | |
| dry | 7782 | 10.5% |
| seasonal | 5075 | 6.8% |
| unknown | 975 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| enough | 41522 | |
| insufficient | 18896 | |
| dry | 7782 | 10.5% |
| seasonal | 5075 | 6.8% |
| unknown | 975 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 87314 | |
| e | 65493 | |
| u | 61393 | |
| i | 56688 | |
| o | 47572 | |
| g | 41522 | |
| h | 41522 | |
| f | 37792 | |
| s | 29046 | 5.3% |
| t | 18896 | 3.5% |
| Other values (8) | 59417 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 546655 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 87314 | |
| e | 65493 | |
| u | 61393 | |
| i | 56688 | |
| o | 47572 | |
| g | 41522 | |
| h | 41522 | |
| f | 37792 | |
| s | 29046 | 5.3% |
| t | 18896 | 3.5% |
| Other values (8) | 59417 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 546655 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 87314 | |
| e | 65493 | |
| u | 61393 | |
| i | 56688 | |
| o | 47572 | |
| g | 41522 | |
| h | 41522 | |
| f | 37792 | |
| s | 29046 | 5.3% |
| t | 18896 | 3.5% |
| Other values (8) | 59417 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 546655 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 87314 | |
| e | 65493 | |
| u | 61393 | |
| i | 56688 | |
| o | 47572 | |
| g | 41522 | |
| h | 41522 | |
| f | 37792 | |
| s | 29046 | 5.3% |
| t | 18896 | 3.5% |
| Other values (8) | 59417 |
source
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| spring | |
|---|---|
| shallow well | |
| machine dbh | |
| river | |
| rainwater harvesting | |
| Other values (5) |
Length
| Max length | 20 |
|---|---|
| Median length | 12 |
| Mean length | 8.9857104 |
| Min length | 3 |
Characters and Unicode
| Total characters | 667189 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | spring |
|---|---|
| 2nd row | rainwater harvesting |
| 3rd row | dam |
| 4th row | machine dbh |
| 5th row | rainwater harvesting |
Common Values
| Value | Count | Frequency (%) |
| spring | 21216 | |
| shallow well | 21140 | |
| machine dbh | 13822 | |
| river | 11964 | |
| rainwater harvesting | 2863 | 3.9% |
| hand dtw | 1108 | 1.5% |
| lake | 950 | 1.3% |
| dam | 840 | 1.1% |
| other | 261 | 0.4% |
| unknown | 86 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| spring | 21216 | |
| shallow | 21140 | |
| well | 21140 | |
| machine | 13822 | |
| dbh | 13822 | |
| river | 11964 | |
| rainwater | 2863 | 2.5% |
| harvesting | 2863 | 2.5% |
| hand | 1108 | 1.0% |
| dtw | 1108 | 1.0% |
| Other values (4) | 2137 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 85510 | |
| r | 53994 | 8.1% |
| e | 53863 | 8.1% |
| h | 53016 | 7.9% |
| i | 52728 | 7.9% |
| a | 46449 | 7.0% |
| w | 46337 | 6.9% |
| s | 45219 | 6.8% |
| n | 42130 | 6.3% |
| 38933 | 5.8% | |
| Other values (11) | 149010 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 667189 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 85510 | |
| r | 53994 | 8.1% |
| e | 53863 | 8.1% |
| h | 53016 | 7.9% |
| i | 52728 | 7.9% |
| a | 46449 | 7.0% |
| w | 46337 | 6.9% |
| s | 45219 | 6.8% |
| n | 42130 | 6.3% |
| 38933 | 5.8% | |
| Other values (11) | 149010 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 667189 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 85510 | |
| r | 53994 | 8.1% |
| e | 53863 | 8.1% |
| h | 53016 | 7.9% |
| i | 52728 | 7.9% |
| a | 46449 | 7.0% |
| w | 46337 | 6.9% |
| s | 45219 | 6.8% |
| n | 42130 | 6.3% |
| 38933 | 5.8% | |
| Other values (11) | 149010 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 667189 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 85510 | |
| r | 53994 | 8.1% |
| e | 53863 | 8.1% |
| h | 53016 | 7.9% |
| i | 52728 | 7.9% |
| a | 46449 | 7.0% |
| w | 46337 | 6.9% |
| s | 45219 | 6.8% |
| n | 42130 | 6.3% |
| 38933 | 5.8% | |
| Other values (11) | 149010 |
source_type
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| spring | |
|---|---|
| shallow well | |
| borehole | |
| river/lake | |
| rainwater harvesting | |
| Other values (2) | 1187 |
Length
| Max length | 20 |
|---|---|
| Median length | 12 |
| Mean length | 9.3073535 |
| Min length | 3 |
Characters and Unicode
| Total characters | 691071 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | spring |
|---|---|
| 2nd row | rainwater harvesting |
| 3rd row | dam |
| 4th row | borehole |
| 5th row | rainwater harvesting |
Common Values
| Value | Count | Frequency (%) |
| spring | 21216 | |
| shallow well | 21140 | |
| borehole | 14930 | |
| river/lake | 12914 | |
| rainwater harvesting | 2863 | 3.9% |
| dam | 840 | 1.1% |
| other | 347 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| spring | 21216 | |
| shallow | 21140 | |
| well | 21140 | |
| borehole | 14930 | |
| river/lake | 12914 | |
| rainwater | 2863 | 2.9% |
| harvesting | 2863 | 2.9% |
| dam | 840 | 0.9% |
| other | 347 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 112404 | |
| e | 82901 | |
| r | 70910 | |
| o | 51347 | 7.4% |
| s | 45219 | 6.5% |
| w | 45143 | 6.5% |
| a | 43483 | 6.3% |
| i | 39856 | 5.8% |
| h | 39280 | 5.7% |
| n | 26942 | 3.9% |
| Other values (10) | 133586 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 691071 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 112404 | |
| e | 82901 | |
| r | 70910 | |
| o | 51347 | 7.4% |
| s | 45219 | 6.5% |
| w | 45143 | 6.5% |
| a | 43483 | 6.3% |
| i | 39856 | 5.8% |
| h | 39280 | 5.7% |
| n | 26942 | 3.9% |
| Other values (10) | 133586 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 691071 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 112404 | |
| e | 82901 | |
| r | 70910 | |
| o | 51347 | 7.4% |
| s | 45219 | 6.5% |
| w | 45143 | 6.5% |
| a | 43483 | 6.3% |
| i | 39856 | 5.8% |
| h | 39280 | 5.7% |
| n | 26942 | 3.9% |
| Other values (10) | 133586 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 691071 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 112404 | |
| e | 82901 | |
| r | 70910 | |
| o | 51347 | 7.4% |
| s | 45219 | 6.5% |
| w | 45143 | 6.5% |
| a | 43483 | 6.3% |
| i | 39856 | 5.8% |
| h | 39280 | 5.7% |
| n | 26942 | 3.9% |
| Other values (10) | 133586 |
source_class
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| groundwater | |
|---|---|
| surface | |
| unknown | 347 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.086114 |
| Min length | 7 |
Characters and Unicode
| Total characters | 748894 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | groundwater |
|---|---|
| 2nd row | surface |
| 3rd row | surface |
| 4th row | groundwater |
| 5th row | surface |
Common Values
| Value | Count | Frequency (%) |
| groundwater | 57286 | |
| surface | 16617 | 22.4% |
| unknown | 347 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| groundwater | 57286 | |
| surface | 16617 | 22.4% |
| unknown | 347 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 131189 | |
| u | 74250 | |
| a | 73903 | |
| e | 73903 | |
| n | 58327 | |
| o | 57633 | |
| w | 57633 | |
| g | 57286 | |
| d | 57286 | |
| t | 57286 | |
| Other values (4) | 50198 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 748894 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 131189 | |
| u | 74250 | |
| a | 73903 | |
| e | 73903 | |
| n | 58327 | |
| o | 57633 | |
| w | 57633 | |
| g | 57286 | |
| d | 57286 | |
| t | 57286 | |
| Other values (4) | 50198 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 748894 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 131189 | |
| u | 74250 | |
| a | 73903 | |
| e | 73903 | |
| n | 58327 | |
| o | 57633 | |
| w | 57633 | |
| g | 57286 | |
| d | 57286 | |
| t | 57286 | |
| Other values (4) | 50198 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 748894 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 131189 | |
| u | 74250 | |
| a | 73903 | |
| e | 73903 | |
| n | 58327 | |
| o | 57633 | |
| w | 57633 | |
| g | 57286 | |
| d | 57286 | |
| t | 57286 | |
| Other values (4) | 50198 | 6.7% |
waterpoint_type
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| communal standpipe | |
|---|---|
| hand pump | |
| other | |
| communal standpipe multiple | |
| improved spring | 959 |
| Other values (2) | 158 |
Length
| Max length | 27 |
|---|---|
| Median length | 18 |
| Mean length | 14.817051 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1100166 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | communal standpipe |
|---|---|
| 2nd row | communal standpipe |
| 3rd row | communal standpipe multiple |
| 4th row | communal standpipe multiple |
| 5th row | communal standpipe |
Common Values
| Value | Count | Frequency (%) |
| communal standpipe | 35628 | |
| hand pump | 21884 | |
| other | 8010 | 10.8% |
| communal standpipe multiple | 7611 | 10.3% |
| improved spring | 959 | 1.3% |
| cattle trough | 150 | 0.2% |
| dam | 8 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| communal | 43239 | |
| standpipe | 43239 | |
| hand | 21884 | |
| pump | 21884 | |
| other | 8010 | 5.4% |
| multiple | 7611 | 5.1% |
| improved | 959 | 0.6% |
| spring | 959 | 0.6% |
| cattle | 150 | 0.1% |
| trough | 150 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 139775 | |
| m | 116940 | |
| n | 109321 | |
| a | 108520 | |
| 73843 | 6.7% | |
| u | 72884 | 6.6% |
| d | 66090 | 6.0% |
| e | 59969 | 5.5% |
| t | 59310 | 5.4% |
| l | 58611 | 5.3% |
| Other values (8) | 234903 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1100166 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| p | 139775 | |
| m | 116940 | |
| n | 109321 | |
| a | 108520 | |
| 73843 | 6.7% | |
| u | 72884 | 6.6% |
| d | 66090 | 6.0% |
| e | 59969 | 5.5% |
| t | 59310 | 5.4% |
| l | 58611 | 5.3% |
| Other values (8) | 234903 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1100166 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| p | 139775 | |
| m | 116940 | |
| n | 109321 | |
| a | 108520 | |
| 73843 | 6.7% | |
| u | 72884 | 6.6% |
| d | 66090 | 6.0% |
| e | 59969 | 5.5% |
| t | 59310 | 5.4% |
| l | 58611 | 5.3% |
| Other values (8) | 234903 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1100166 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| p | 139775 | |
| m | 116940 | |
| n | 109321 | |
| a | 108520 | |
| 73843 | 6.7% | |
| u | 72884 | 6.6% |
| d | 66090 | 6.0% |
| e | 59969 | 5.5% |
| t | 59310 | 5.4% |
| l | 58611 | 5.3% |
| Other values (8) | 234903 |
waterpoint_type_group
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| communal standpipe | |
|---|---|
| hand pump | |
| other | |
| improved spring | 959 |
| cattle trough | 150 |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 13.894505 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1031667 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | communal standpipe |
|---|---|
| 2nd row | communal standpipe |
| 3rd row | communal standpipe |
| 4th row | communal standpipe |
| 5th row | communal standpipe |
Common Values
| Value | Count | Frequency (%) |
| communal standpipe | 43239 | |
| hand pump | 21884 | |
| other | 8010 | 10.8% |
| improved spring | 959 | 1.3% |
| cattle trough | 150 | 0.2% |
| dam | 8 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| communal | 43239 | |
| standpipe | 43239 | |
| hand | 21884 | |
| pump | 21884 | |
| other | 8010 | 5.7% |
| improved | 959 | 0.7% |
| spring | 959 | 0.7% |
| cattle | 150 | 0.1% |
| trough | 150 | 0.1% |
| dam | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 132164 | |
| m | 109329 | |
| n | 109321 | |
| a | 108520 | |
| 66232 | 6.4% | |
| d | 66090 | 6.4% |
| u | 65273 | 6.3% |
| e | 52358 | 5.1% |
| o | 52358 | 5.1% |
| t | 51699 | 5.0% |
| Other values (8) | 218323 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1031667 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| p | 132164 | |
| m | 109329 | |
| n | 109321 | |
| a | 108520 | |
| 66232 | 6.4% | |
| d | 66090 | 6.4% |
| u | 65273 | 6.3% |
| e | 52358 | 5.1% |
| o | 52358 | 5.1% |
| t | 51699 | 5.0% |
| Other values (8) | 218323 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1031667 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| p | 132164 | |
| m | 109329 | |
| n | 109321 | |
| a | 108520 | |
| 66232 | 6.4% | |
| d | 66090 | 6.4% |
| u | 65273 | 6.3% |
| e | 52358 | 5.1% |
| o | 52358 | 5.1% |
| t | 51699 | 5.0% |
| Other values (8) | 218323 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1031667 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| p | 132164 | |
| m | 109329 | |
| n | 109321 | |
| a | 108520 | |
| 66232 | 6.4% | |
| d | 66090 | 6.4% |
| u | 65273 | 6.3% |
| e | 52358 | 5.1% |
| o | 52358 | 5.1% |
| t | 51699 | 5.0% |
| Other values (8) | 218323 |
| amount_tsh | date_recorded | funder | gps_height | installer | longitude | latitude | wpt_name | num_private | basin | subvillage | region | region_code | district_code | lga | ward | population | public_meeting | recorded_by | scheme_management | scheme_name | permit | construction_year | extraction_type | extraction_type_group | extraction_type_class | management | management_group | payment | payment_type | water_quality | quality_group | quantity | quantity_group | source | source_type | source_class | waterpoint_type | waterpoint_type_group | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | |||||||||||||||||||||||||||||||||||||||
| 69572 | 6000.0 | 2011-03-14 | Roman | 1390 | Roman | 34.938093 | -9.856322 | none | 0 | Lake Nyasa | Mnyusi B | Iringa | 11 | 5 | Ludewa | Mundindi | 109 | True | GeoData Consultants Ltd | VWC | Roman | False | 1999 | gravity | gravity | gravity | vwc | user-group | pay annually | annually | soft | good | enough | enough | spring | spring | groundwater | communal standpipe | communal standpipe |
| 8776 | 0.0 | 2013-03-06 | Grumeti | 1399 | GRUMETI | 34.698766 | -2.147466 | Zahanati | 0 | Lake Victoria | Nyamara | Mara | 20 | 2 | Serengeti | Natta | 280 | NaN | GeoData Consultants Ltd | Other | NaN | True | 2010 | gravity | gravity | gravity | wug | user-group | never pay | never pay | soft | good | insufficient | insufficient | rainwater harvesting | rainwater harvesting | surface | communal standpipe | communal standpipe |
| 34310 | 25.0 | 2013-02-25 | Lottery Club | 686 | World vision | 37.460664 | -3.821329 | Kwa Mahundi | 0 | Pangani | Majengo | Manyara | 21 | 4 | Simanjiro | Ngorika | 250 | True | GeoData Consultants Ltd | VWC | Nyumba ya mungu pipe scheme | True | 2009 | gravity | gravity | gravity | vwc | user-group | pay per bucket | per bucket | soft | good | enough | enough | dam | dam | surface | communal standpipe multiple | communal standpipe |
| 67743 | 0.0 | 2013-01-28 | Unicef | 263 | UNICEF | 38.486161 | -11.155298 | Zahanati Ya Nanyumbu | 0 | Ruvuma / Southern Coast | Mahakamani | Mtwara | 90 | 63 | Nanyumbu | Nanyumbu | 58 | True | GeoData Consultants Ltd | VWC | NaN | True | 1986 | submersible | submersible | submersible | vwc | user-group | never pay | never pay | soft | good | dry | dry | machine dbh | borehole | groundwater | communal standpipe multiple | communal standpipe |
| 19728 | 0.0 | 2011-07-13 | Action In A | 0 | Artisan | 31.130847 | -1.825359 | Shuleni | 0 | Lake Victoria | Kyanyamisa | Kagera | 18 | 1 | Karagwe | Nyakasimbi | 0 | True | GeoData Consultants Ltd | NaN | NaN | True | 0 | gravity | gravity | gravity | other | other | never pay | never pay | soft | good | seasonal | seasonal | rainwater harvesting | rainwater harvesting | surface | communal standpipe | communal standpipe |
| 9944 | 20.0 | 2011-03-13 | Mkinga Distric Coun | 0 | DWE | 39.172796 | -4.765587 | Tajiri | 0 | Pangani | Moa/Mwereme | Tanga | 4 | 8 | Mkinga | Moa | 1 | True | GeoData Consultants Ltd | VWC | Zingibali | True | 2009 | submersible | submersible | submersible | vwc | user-group | pay per bucket | per bucket | salty | salty | enough | enough | other | other | unknown | communal standpipe multiple | communal standpipe |
| 19816 | 0.0 | 2012-10-01 | Dwsp | 0 | DWSP | 33.362410 | -3.766365 | Kwa Ngomho | 0 | Internal | Ishinabulandi | Shinyanga | 17 | 3 | Shinyanga Rural | Samuye | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | swn 80 | swn 80 | handpump | vwc | user-group | never pay | never pay | soft | good | enough | enough | machine dbh | borehole | groundwater | hand pump | hand pump |
| 54551 | 0.0 | 2012-10-09 | Rwssp | 0 | DWE | 32.620617 | -4.226198 | Tushirikiane | 0 | Lake Tanganyika | Nyawishi Center | Shinyanga | 17 | 3 | Kahama | Chambo | 0 | True | GeoData Consultants Ltd | NaN | NaN | True | 0 | nira/tanira | nira/tanira | handpump | wug | user-group | unknown | unknown | milky | milky | enough | enough | shallow well | shallow well | groundwater | hand pump | hand pump |
| 53934 | 0.0 | 2012-11-03 | Wateraid | 0 | Water Aid | 32.711100 | -5.146712 | Kwa Ramadhan Musa | 0 | Lake Tanganyika | Imalauduki | Tabora | 14 | 6 | Tabora Urban | Itetemia | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | india mark ii | india mark ii | handpump | vwc | user-group | never pay | never pay | salty | salty | seasonal | seasonal | machine dbh | borehole | groundwater | hand pump | hand pump |
| 46144 | 0.0 | 2011-08-03 | Isingiro Ho | 0 | Artisan | 30.626991 | -1.257051 | Kwapeto | 0 | Lake Victoria | Mkonomre | Kagera | 18 | 1 | Karagwe | Kaisho | 0 | True | GeoData Consultants Ltd | NaN | NaN | True | 0 | nira/tanira | nira/tanira | handpump | vwc | user-group | never pay | never pay | soft | good | enough | enough | shallow well | shallow well | groundwater | hand pump | hand pump |
| amount_tsh | date_recorded | funder | gps_height | installer | longitude | latitude | wpt_name | num_private | basin | subvillage | region | region_code | district_code | lga | ward | population | public_meeting | recorded_by | scheme_management | scheme_name | permit | construction_year | extraction_type | extraction_type_group | extraction_type_class | management | management_group | payment | payment_type | water_quality | quality_group | quantity | quantity_group | source | source_type | source_class | waterpoint_type | waterpoint_type_group | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | |||||||||||||||||||||||||||||||||||||||
| 59757 | 0.0 | 2013-02-24 | Villagers | 1291 | Villagers | 35.345384 | -9.831170e+00 | Kwa Reonard | 0 | Lake Nyasa | Tuliani | Ruvuma | 10 | 2 | Songea Rural | Wino | 0 | True | GeoData Consultants Ltd | VWC | Mradi wa maji wa wino | True | 2009 | gravity | gravity | gravity | vwc | user-group | never pay | never pay | soft | good | enough | enough | river | river/lake | surface | communal standpipe | communal standpipe |
| 64579 | 0.0 | 2012-10-26 | Dwsp | 0 | DWE | 0.000000 | -2.000000e-08 | Iguna | 0 | Lake Victoria | Nyerere | Shinyanga | 17 | 1 | Bariadi | Kasoli | 0 | NaN | GeoData Consultants Ltd | WUG | NaN | False | 0 | swn 80 | swn 80 | handpump | wug | user-group | unknown | unknown | soft | good | enough | enough | shallow well | shallow well | groundwater | hand pump | hand pump |
| 57731 | 600.0 | 2013-01-27 | Isf | 808 | DWE | 29.740224 | -4.882705e+00 | Hongera | 0 | Lake Tanganyika | Mzizini A | Kigoma | 16 | 3 | Kigoma Rural | Simbo | 230 | True | GeoData Consultants Ltd | WUG | Mkongoro Two | True | 2009 | gravity | gravity | gravity | vwc | user-group | pay monthly | monthly | soft | good | enough | enough | river | river/lake | surface | communal standpipe multiple | communal standpipe |
| 65541 | 0.0 | 2013-02-04 | Oxfarm | 1641 | OXFARM | 29.768139 | -4.480618e+00 | Mwandami | 0 | Lake Tanganyika | Kosoro | Kigoma | 16 | 2 | Kigoma Rural | Mkigo | 1400 | True | GeoData Consultants Ltd | Water authority | NaN | False | 1995 | other | other | other | vwc | user-group | never pay | never pay | soft | good | enough | enough | spring | spring | groundwater | other | other |
| 68174 | 0.0 | 2012-11-07 | Netherlands | 0 | DWE | 34.096878 | -3.079689e+00 | Ikanayugu | 0 | Lake Victoria | Maganju | Shinyanga | 17 | 2 | Maswa | Ipililo | 0 | True | GeoData Consultants Ltd | WUG | NaN | False | 0 | nira/tanira | nira/tanira | handpump | wug | user-group | other | other | soft | good | enough | enough | shallow well | shallow well | groundwater | hand pump | hand pump |
| 39307 | 0.0 | 2011-02-24 | Danida | 34 | Da | 38.852669 | -6.582841e+00 | Kwambwezi | 0 | Wami / Ruvu | Yombo | Pwani | 6 | 1 | Bagamoyo | Yombo | 20 | True | GeoData Consultants Ltd | VWC | Bagamoyo wate | True | 1988 | mono | mono | motorpump | vwc | user-group | never pay | never pay | soft | good | enough | enough | river | river/lake | surface | communal standpipe | communal standpipe |
| 18990 | 1000.0 | 2011-03-21 | Hiap | 0 | HIAP | 37.451633 | -5.350428e+00 | Bonde La Mkondoa | 0 | Pangani | Mkondoa | Tanga | 4 | 7 | Kilindi | Mvungwe | 2960 | True | GeoData Consultants Ltd | VWC | NaN | False | 1994 | nira/tanira | nira/tanira | handpump | vwc | user-group | pay annually | annually | salty | salty | insufficient | insufficient | shallow well | shallow well | groundwater | hand pump | hand pump |
| 28749 | 0.0 | 2013-03-04 | NaN | 1476 | NaN | 34.739804 | -4.585587e+00 | Bwawani | 0 | Internal | Juhudi | Singida | 13 | 2 | Singida Rural | Ughandi | 200 | True | GeoData Consultants Ltd | VWC | NaN | NaN | 2010 | gravity | gravity | gravity | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | dam | dam | surface | communal standpipe | communal standpipe |
| 33492 | 0.0 | 2013-02-18 | Germany | 998 | DWE | 35.432732 | -1.058416e+01 | Kwa John | 0 | Lake Nyasa | Namakinga B | Ruvuma | 10 | 2 | Songea Rural | Maposeni | 150 | True | GeoData Consultants Ltd | VWC | Mradi wa maji wa maposeni | True | 2009 | gravity | gravity | gravity | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | river | river/lake | surface | communal standpipe | communal standpipe |
| 68707 | 0.0 | 2013-02-13 | Government Of Tanzania | 481 | Government | 34.765054 | -1.122601e+01 | Kwa Mzee Chagala | 0 | Lake Nyasa | Kamba | Ruvuma | 10 | 3 | Mbinga | Mbamba bay | 40 | True | GeoData Consultants Ltd | VWC | DANIDA | True | 2008 | gravity | gravity | gravity | vwc | user-group | never pay | never pay | soft | good | dry | dry | spring | spring | groundwater | communal standpipe | communal standpipe |
Most frequently occurring
| amount_tsh | date_recorded | funder | gps_height | installer | longitude | latitude | wpt_name | num_private | basin | subvillage | region | region_code | district_code | lga | ward | population | public_meeting | recorded_by | scheme_management | scheme_name | permit | construction_year | extraction_type | extraction_type_group | extraction_type_class | management | management_group | payment | payment_type | water_quality | quality_group | quantity | quantity_group | source | source_type | source_class | waterpoint_type | waterpoint_type_group | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 0.0 | 2011-07-18 | Government Of Tanzania | 0 | Government | 0.00000 | -2.000000e-08 | Hospital | 0 | Lake Victoria | Nyanza | Mwanza | 19 | 6 | Geita | Kalangalala | 0 | True | GeoData Consultants Ltd | VWC | Kalangalala | True | 0 | submersible | submersible | submersible | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | machine dbh | borehole | groundwater | communal standpipe | communal standpipe | 3 |
| 7 | 0.0 | 2011-07-27 | Hesawa | 0 | DWE | 0.00000 | -2.000000e-08 | Bombani | 0 | Lake Victoria | Kabulabunyasi | Mwanza | 19 | 6 | Geita | Lubanga | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | nira/tanira | nira/tanira | handpump | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | shallow well | shallow well | groundwater | hand pump | hand pump | 3 |
| 0 | 0.0 | 2011-07-13 | He | 0 | HE | 31.61953 | -1.793342e+00 | Kahindu | 0 | Lake Victoria | Ikondoa | Kagera | 18 | 3 | Muleba | Ikondo | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | gravity | gravity | gravity | vwc | user-group | never pay | never pay | soft | good | enough | enough | spring | spring | groundwater | improved spring | improved spring | 2 |
| 2 | 0.0 | 2011-07-18 | Government Of Tanzania | 0 | Government | 0.00000 | -2.000000e-08 | Nersing College | 0 | Lake Victoria | Nyanza | Mwanza | 19 | 6 | Geita | Kalangalala | 0 | True | GeoData Consultants Ltd | VWC | Borehole | True | 0 | afridev | afridev | handpump | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | machine dbh | borehole | groundwater | hand pump | hand pump | 2 |
| 3 | 0.0 | 2011-07-19 | Government Of Tanzania | 0 | Government | 0.00000 | -2.000000e-08 | K/Secondary | 0 | Lake Victoria | Kisese | Mwanza | 19 | 6 | Geita | Kalangalala | 0 | True | GeoData Consultants Ltd | VWC | 14 Kambarage | True | 0 | submersible | submersible | submersible | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | machine dbh | borehole | groundwater | communal standpipe | communal standpipe | 2 |
| 4 | 0.0 | 2011-07-19 | Government Of Tanzania | 0 | Government | 0.00000 | -2.000000e-08 | Mulangila | 0 | Lake Victoria | 14Kambalage | Mwanza | 19 | 6 | Geita | Kalangalala | 0 | True | GeoData Consultants Ltd | VWC | 14 Kambarage | True | 0 | submersible | submersible | submersible | vwc | user-group | pay per bucket | per bucket | soft | good | insufficient | insufficient | machine dbh | borehole | groundwater | communal standpipe multiple | communal standpipe | 2 |
| 5 | 0.0 | 2011-07-19 | Plan International | 0 | Plan Internationa | 0.00000 | -2.000000e-08 | Elimu Maalum | 0 | Lake Victoria | Mbugani | Mwanza | 19 | 6 | Geita | Kalangalala | 0 | True | GeoData Consultants Ltd | VWC | Nyankumbu | True | 0 | submersible | submersible | submersible | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | machine dbh | borehole | groundwater | communal standpipe | communal standpipe | 2 |
| 6 | 0.0 | 2011-07-26 | Hesawa | 0 | DWE | 0.00000 | -2.000000e-08 | Bombani | 0 | Lake Victoria | Isadukilo | Mwanza | 19 | 6 | Geita | Lubanga | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | nira/tanira | nira/tanira | handpump | vwc | user-group | never pay | never pay | soft | good | insufficient | insufficient | shallow well | shallow well | groundwater | hand pump | hand pump | 2 |
| 8 | 0.0 | 2011-07-28 | Government Of Tanzania | 0 | Government | 0.00000 | -2.000000e-08 | Mahakama | 0 | Lake Victoria | Nyampa A | Mwanza | 19 | 6 | Geita | Kasamwa | 0 | True | GeoData Consultants Ltd | VWC | Kasamwa | True | 0 | ksb | submersible | submersible | vwc | user-group | unknown | unknown | unknown | unknown | dry | dry | dam | dam | surface | communal standpipe multiple | communal standpipe | 2 |
| 9 | 0.0 | 2011-08-02 | Hesawa | 0 | Hesawa | 0.00000 | -2.000000e-08 | Nyanza | 0 | Lake Victoria | Mjini | Mwanza | 19 | 6 | Geita | Nyang'hwale | 0 | True | GeoData Consultants Ltd | VWC | NaN | True | 0 | other | other | other | vwc | user-group | unknown | unknown | unknown | unknown | dry | dry | shallow well | shallow well | groundwater | other | other | 2 |